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Abstract 

We consider local Markov chain Monte-Carlo algorithms for sampling from the weighted distribution of 
independent sets with activity A, where the weight of an independent set / is A' J L A recent result has estab- 
lished that Gibbs sampling is rapidly mixing in sampling the distribution for graphs of maximum degree d 
and A < X c (d), where A c (g0 is the critical activity for uniqueness of the Gibbs measure (i.e., for decay of 
correlations with distance in the weighted distribution over independent sets) on the d-regular infinite tree. 

We show that for d > 3, A just above A c (d) with high probability over d-regular bipartite graphs, any local 
Markov chain Monte-Carlo algorithm takes exponential time before getting close to the stationary distribu- 
tion. 

Our results provide a rigorous justification for "replica" method heuristics. These heuristics were invented 
in theoretical physics and are used in order to derive predictions on Gibbs measures on random graphs in 
terms of Gibbs measures on trees. A major theoretical challenge in recent years is to provide rigorous proofs 
for the correctness of such predictions. Our results establish such rigorous proofs for the case of hard-core 
model on bipartite graphs. 

We conjecture that A c is in fact the exact threshold for this computational problem, i.e., that for A > A c 
it is NP-hard to approximate the above weighted sum over independent sets to within a factor polynomial in 
the size of the graph. 



1 Introduction 

1.1 Sampling weighted independent sets 

Approximately counting (or sampling) weighted independent sets is a central problem in computational com- 
plexity, in statistical physics, where lattice gases are modelled, and in communication networks. Typically the 
weights of the independent sets are governed by an activity parameter A so that the weight of an independent 
set I is proportional to A' 7 '. 

Intuitively, counting or sampling independent sets becomes harder as A increases. Indeed, if we could 
sample an independent set from the distribution resulting from a large enough setting of A then we would have 
an algorithm for finding the maximum independent set of a graph, an NP-hard problem. 
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In this paper we restrict our attention to local Markov chain Monte Carlo methods for sampling weighted 
independent sets. A recent paper of the second author lUTTll established the that Gibbs sampling, a simple local 
Markov chain Monte Carlo algorithm is rapidly mixing for graphs of maximum degree d if A < X c (d) where 
X c (d) = (d — l) d ~ 1 /(d — 2) d is the threshold for decay of correlations on the regular tree of degree d. In this 
paper we show that X c (d) is in fact the exact threshold for rapid mixing of local dynamics, by showing that for 
random bipartite <i-regular graphs, with high probability, the mixing time of local dynamics is exponential in 
the size of the graph if A > X c (d). 

In the following subsections we state our main result and provide motivations for studying this problem 
from computational complexity, the replica method and the role of uniqueness and extremality 

1.2 Our results 

In our main result we show that for d > 3 and A just above X c (d) for almost all bipartite d-regular graphs, any 
local MCMC must take exponential time to get close to the stationary distribution. More formally we show the 
following. 

Theorem 1.1 For all d > 3 there exists > such that for X c (d) < X < X c (d) + there exists a = a(X, d) > 1 
such that with high probability (probability tending to 1 as n — > 00) for a random d-regular bipartite graph, 
the mixing time of the Glauber dynamics on G is £l{a n ). Moreover, the same claim holds true for any reversible 
dynamics which updates at most o(n) nodes at each iteration. 

1.3 Computational complexity of sampling weighted independent sets 

Based on the intuition that sampling independent sets is computatioally harder for larger values of A, it was 
shown Il2! that for any d > 4, it is NP-hard to approximate the above weighted sum over independent sets, 
even to within a polynomial factor, for graphs of maximum degree d and A > §, where c is a (large enough) 
absolute constant. 

On the other hand, the existence of a fully polynomial approximation scheme has been established for an 
ever- improving sequence Ij9[ 151 l20l I2T11 of bounds on A which are also inverse linear in the maximum degree. 
It is a fascinating challenge to determine the exact threshold (in terms of A as a function of d~) for which the 
counting is hard to approximate. (It is known that for any given (d, A), either there exists a fully polynomial 
approximation scheme or it is NP-hard to approximate the sum over independent sets to within a polynomial 
factor G8]]). 

It has been speculated (though not formally conjectured) that the hardness threshold corresponds to the 
threshold for decay of spatial correlations for the weighted distribution over independent sets (uniqueness of 
the Gibbs measure). In particular, it is was speculated that the critical activity X c (d) = (d — l) d ~ 1 /(d — 2) d for 
decay of correlations on the regular tree of degree d is also the threshold for the computational problem. This 
is supported by a recent paper of the second author [21] which established the existence of a fully polynomial 
approximation scheme for counting independent sets for every d and A < X c (d). The present paper is motivated 
by this speculation. In particular, we conjecture the following. 

Conjecture 1.2 For every d > 3 and all X > X c (d), unless RP = NP there does not exist a fully polynomial 
approximation scheme for counting weighted independent sets of graphs of maximum degree d with activity X. 

We provide evidence supporting the above conjecture by analyzing local Markov chain Monte Carlo algorithms 
for sampling independent sets. By local algorithms we refer to algorithms in which the number of vertices 
updated in a single step of the chain is o(n), where n is the number of vertices of the graph. 
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Of course, slow mixing of local MCMC algorithms as established in this paper does not generally imply the 
hardness result stated in the conjecture. For example, the Ising model at low temperatures is an example where 
the local MCMC algorithm mixes slowly while there is an FPRAS [6] for computing the partition function. 

However, we believe that our results do give support for the conjecture. First, unlike the Ising model where 
there is no computational phase transition and an FPRAS exits for all temperatures [6], we know (as already 
mentioned above) that approximate counting of independent sets is computationally hard for A > | for a 
large enough absolute constant c. Second, our result establishes that for A just above \ c (d) and most d- regular 
bipartite graphs balanced independent sets form an exponentially small bottleneck, i.e., the density of a typical 
independent set in these graphs has a positive bias to either V\ or V2 except with exponentially small probability. 
The availability of such graphs could open the way for hardness constructions in which these graphs would be 
used as gadgets for, e.g., encoding a binary variable, and thus that the hardness threshold for approximate 
counting of independent sets coincides with the threshold for the availability of such graphs. See |0| for an 
easier construction of a similar flavor that was used to establish that approximate counting of independent sets 
with A = 1 is hard for d > 24. 

1.4 Our result and the Replica method 

Another important motivation for our result is the "replica" heuristic developed in theoretical physics IfTTl [T2ll . 
This method gives predictions on the behavior of Gibbs measures on random graphs that are based on the 
analysis of Gibbs measures on trees. The method has been extensively used to derive predictions of the behavior 
of many random systems Ifl3ll . The method has also yielded an empirically effective algorithm for solving 
random 3-SAT problems at the highest known densities Ifl4ll . 

The theoretical study of the replica method has been a major challenge in mathematics, theoretical physics, 
probability, engineering and computer science. A number of results proved the validity of the method for 
various specific models such as the SK model |fl~9ll , the assignment problem [1] and some results in coding 
theory |Tl7lfT0ll . However, all of these results deal with specific problems. No general results are known for the 
applicability of the method. 

Our results provide another example where it can be rigorously shown that replica calculations do de- 
termine the behavior of Gibbs measures on random graphs. In particular, our results are strong enough to 
actually determine the dynamics properties of the Gibbs measures, i.e., the convergence time of reversible local 
dynamics. 

1.5 Uniqueness, Extremality and Slow Mixing 

Another important aspect of our results is that establishing exponentially slow mixing for random d-regular 
graphs for A just above A c shows that these graphs, while locally similar to the ci-regular tree, behave in a very 
different manner than the tree w.r.t. the mixing time of the chain for A > A c . On the regular tree, the mixing 
time of the local Markov chain is 0(n log n) even above A c . More precisely, there exists a second threshold lfT5H 
Ai > A c such that the mixing time is 0(n log n) for X < X\. (On the tree the mixing time is polynomial in n for 
any A |]2]|.) This corresponds to the fact that the Gibbs measure is extremal for A < A c (see also I131D. 

Our results show that while random regular graphs are locally tree-like, they have very different properties 
when it comes to convergence of local dynamics. In particular, while the threshold for uniqueness of the 
Gibbs measure (appropriately defined) is the same in random graphs and the tree, in random graphs — unlike 
the tree — this threshold is also where the Gibbs measure ceases to be extremal and where the mixing time 
undergoes a sharp transition and becomes exponentially slow. This is in line with the intuition that the mixing 
time depends more crucially on the size of the separators of the graph than on their local structure. Trees have 
very small separators (e.g., the root) and thus have fast mixing time even above A c . Random graphs on the 
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other hand are expanders, and indeed have exponentially slow mixing already just above A c . 
1 . 6 Proof Technique 

Our proof borrows its initial approach from the proof in [4] where the case A = 1 is analyzed. It is shown 
in Hi that most random <i-regular bipartite graphs exhibit an exponentially small bottleneck for d > 6, causing 
the Markov chain to take exponential time to mix. Note that X c (d) < A c (6) « 0.763 < 1 for every d > 6 so even 
though the latter result is tight for A = 1 in terms of the degree d, it is far from tight for general A as a function 
of d. 

In this paper we show that the model suggested in 01 can be in fact be analyzed all the way down to \ c (d) 
for d > 3. We note that the arguments of [4] do not extend to give a proof of slow mixing all the way down 
to X c {d): in that case, where a lower bound on the number of balanced independent sets of a given size and 
location was required, it sufficed to use a crude lower bound that applied for all graphs. This was made easier 
because the balanced independent sets were far from the typical ones (in terms of their "balancedness"). For 
our present purposes, we need to consider cases where the typical independent sets are quite close to being 
balanced. As a result, we found it necessary to use some sophisticated second moment calculations. This is 
even just to show the existence of at least one graph with an exponentially small bottleneck of the type we 
desire. To show that almost all graphs have such a bottleneck we used the small subgraph conditioning method 
of the third author and Robinson (see H22I0 . 

These apply for all A slightly above X c (d). We believe that the same result holds for for all d and A > X c (d). 
Also, we would like to point out that our first-moment analysis, while similar to J4]|, exhibits more explicitly 
the role of Gibbs measures on the regular tree. 

Acknowledgments: E.M. and D.W. wish to thank Alistair Sinclair for interesting discussions. 

2 Preliminaries and statements of results 
2.1 The random graph model 

We consider the following model for random graphs. The graphs are all bipartite with vertex sets V\, V2 of size n 
each. We choose d random perfect matchings between the two vertex sets so that every vertex has degree d. 
Note that there maybe parallel edges (with probability asymptotic to a constant less than 1). However, for the 
hard core model, a parallel edge has exactly the same effect as does a single edge. So this can alternatively 
be regarded as a model of simple graphs in which the degree of each node is at most d. We denote the above 
distribution over graphs by Q = Q(n, d). As is common in the theory of random graphs, we will use the term 
asymptotically almost surely (a.a.s.) to refer to a sqequence of probabilities converging to 1 as n — > 00. 

Remark: The probability space Q(n, d) we are working with is the set of bipartite (multi)graphs obtained by taking d 
random perfect matchings between two sets V\ and V2 of n vertices each. This probability space is contiguous with a 
uniformly random d-regular graph (see the note after the proof of [16, Theorem 4]), and hence, all of the results below 
proven a.a.s. for Q(n,d) consequently also hold a.a.s. in the uniform model, with the uniform distribution over simple 
bipartite e?-regular graphs (as well as various other models contiguous to it). □ 



2.2 The hard-core model 

Let A > be an activity parameter and G a finite graph. Denote by Zq the set of independent sets of G. The 
hard-core distribution on G with activity A, denoted /ig,a, is the distribution over Zq in which the probability 
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of an independent set / G Xq is proportional to A^, i.e., 



Mg,a [I] 



(1) 



Zg,x 



where Z GiA = £ 



2.3 Gibbs measures on the regular tree 

Let T d be the infinite regular tree of degree d. A probability measure \i on independent sets of T d is Gibbs if 
for every finite subtree T, conditioning fi upon the event that all the vertices on the outer boundary of T are 
unoccupied gives the same probability distribution on independent sets of T as defined by (Q]) with G = T. 
Moreover, \i is a simple Gibbs measure on T d if for any vertex v, conditioning fj, on any of the two possible values 
at v results in a measure in which the configurations on the d (infinite) subtrees rooted at the children of v 
are independent of each other. (Notice that the probability distribution on a finite subtree is always simple.) 
A translation-invariant Gibbs measure on T d is a measure that is invariant under all automorphisms of T d . 
Finally, a semi-translation-invariant Gibbs measure on T b is one that is invariant under all parity-preserving 
automorphisms of T d . 

It is well known [8] that the hard-core model on T d admits a unique simple translation invariant Gibbs 
measure for all values of A. For A < X c {d) = (d — l) d ^ 1 /(d — 2) d this measure is the unique Gibbs measure of 
any kind. However, for A > A c there are additional Gibbs measures, and in particular, two additional simple 
semi-invariant measures in which the vertices of one parity are more likely to be occupied than the vertices of 
the other parity. 

2.4 The Glauber dynamics and other local dynamics 

Even though our results apply to any Markov chain Monte-Carlo algorithm that updates at most o(n) vertices 
in one step, for convenience and definiteness, we will first discuss a well-known and simple Markov chain for 
sampling weighted independent sets, called the Glauber dynamics. This chain is defined as follows. Starting 
from the current independent set /, choose u.a.r. a vertex v from V1UV2. If all the neighbors of v are unoccupied, 
set v to be occupied with probability A/(A + 1) and otherwise set v to be unoccupied. It is easy to verify that 
this Markov chain converges to the hard-core distribution ^g,x- 

More generally, we will consider dynamics where at each stage a (random) set of vertices W is chosen 
according to some fixed distribution. Then the configuration of the vertices at W is sampled according to the 
conditional probability at (Vi U V2) \ W. We will only consider cases where the sets W chosen are of size o(n). 
It is easy to see under mild conditions that this dynamics also converges to the hard-core distribution /xg,a- 

The main question for both dynamics concerns how many steps it takes the chain to get sufficiently close 
to this stationary distribution. The mixing time of the chain is defined as the number of steps needed in order 
to guarantee that the chain, starting from an arbitrary state, is within total variation distance l/2e from the 
stationary distribution. 

Our method for establishing slow mixing of the Glauber dynamics is based on conductance type arguments. 
Namely, in order to prove slow mixing we will show the existence of A c T whose measure is at most \ 
and whose boundary is exponentially smaller, i.e., the probability of escaping A is exponentially small. The 
existence of such a subset is well-known to imply slow mixing of the Markov chain. For example, the following 
is taken from [4, Claim 2.3]. 

Claim 2.1 Let M be a Markov chain with state space $7, transition matrix P, and stationary distribution /j,. Let 
A dVLbe a set of states such that fi[A] < \, and B c £1 be a set of states that form a "barrier" in the sense Pij = 
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whenever i G A\B and j e A c \ B. Then the mixing time of M is at least ^gj- 
2.5 Main result 

We show that for d > 3 and A just above A c , a.a.s. the mixing time of a random graph drawn from Q(n,d) 
is exponential in n. As explained above, this is done by establishing an exponentially small bottleneck in the 
state space. For a given graph G, let X\ = {I e Xg \ \I fl V\\ > \I D V2W, define X2 similarly, and let Xb = 
{I e Xg I \I (~l V\\ = \I n V2I}. Notice that Xb forms a barrier between 2i and J 2 , i.e., the Markov chain must go 
through Xb in order to cross from X\ to J2 and vice versa. 

Similarly, given r > we define X[ = {I G Xq \ \I n Vi| > \I n V2I + rn}, define TJ similarly, and let 2]j = 
{J e Xg I |-f n Vi I — \I D V2I < rn}. Now T]j forms a barrier between JJ" and TJ for local Markov chains. In 
other words, for any reversible chain that updates at most rn vertices at an iteration, the Markov chain must 
go through X T B in order to cross from X\ to 2J and vice versa. 

Our main result establishes that for a random graph from Q, Xb is an exponentially small bottleneck. 
Theorem 2.2 There exists function a(d) and defined for d > 3 and satisfying 

• a(d) > 1 for all d>3. 

• e d > for all d>3. 

such that for all \ c (d) < A < X c (d) + ed there exists a = a(X,d) > 1,5 = S(X,d) > such that a.a.s., a graph 
drawnfrom G(n,d) satisfies [i\Xb\ < a~ n mm{/i[Ji],/4J 2 ]}. Moreover < a~ n min |/i[2"f], /i[T|]}. 

Since min{/x[Ti], /xpVI} < f> applying Claim [ZT] gives the following. 

Corollary 2.3 For all d > 3 there exists > such that for X c (d) < X < X c (d) + e d there exists a = a(X) > 1 
such that a.a.s. for a graph G drawnfrom G(n,d), the mixing time of the Glauber dynamics on G is 0(a"). The 
same claim holds true for any reversible dynamics which updates at most o(n) nodes at each iteration. 

In order to establish the above result we needed to resort to certain detailed calculations which we only carried 
out for A close to A c . However, we believe the same result holds for every d > 3 and any A > A c (<i). 

Conjecture 2.4 For every d > 3 and any X > X c (d) there exists a = a(d, A) > 1 such that a.a.s. on Q(n, d) it holds 
that /j,[Xb] < a~ n min{/u[Xi],^[J2]}. 

3 Proof of the main theorem 

In this section we prove Theorem 12.21 which states that for A > A c , the set of balanced independent sets forms 
an exponentially small bottleneck. The analysis proceeds by calculating for any given pair of densities (a, (3) 
the weight of independent sets that occupy an and j3n vertices of V\ and V%, respectively. Naturally we assume 
that an and f3n are both integers. Let X^ = {I e X | / n V± = an, I n V 2 = pn} and = E /eI ^ X^ a+ ^ n . 

Below we use O(-) and O(-) to denote O(-) and O(-) up to factors that are polynomial in n. 

Our first step is analyzing the expected weight (over graphs) of the different possible densities. In what 
follows, H(x) is the entropy function w.r.t. natural logarithms, i.e., H(x) = — xln(x) — (1 — x)ln(l — x). The 
following proposition is proved in Section 0] 
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Proposition 3.1 Fix d and A. For any a, ft € [0, 1], 
where 

*l(a,/3) = (a + /3)ln(A) + H(a)+H(/3)+d-^ 1 (a,/3), 

and 

*i = (l-^HC^J-HCa). 

The analytic properties of <&i in the triangle 

T = {(a, /?) | a, (5 > and a + /? < 1} 
play an important role similarly to Q where it was needed for the case A = 1. 

The analysis for general A is essentially the same. However, our analysis demonstrates explicitly the role 
of Gibbs measures for the hard-core model on the infinite d-regular tree. In Section|4|we derive the following 
lemma. 

Lemma 3.2 The following holds: 

• The maximum of &i over T along the line a = (3 is achieved at a = (3 = p*, where p* is the probability that 
any given vertex is occupied in the unique simple translation-invariant Gibbs measure on T d . 

• If A < \ c (d) then a = f3 = p* is also the unique maximum o/<3?i over the whole ofT. 

• If A > A c (d) then a = (3 = p* is a saddle point and there exist p\ < p* < p2 for which the only two maxima 
of $1 over T are at (a = pi,/3 = P2) and (a = P2,(3 = pi). Furthermore, pi,p% are the probabilities 
of occupancy of even and odd vertices, respectively, in the two additional simple semi-translation-invariant 
Gibbs measures on T d . 

• p*,pi and p2 all vary continuously with A. Moreover, p\ — p* , p2 — p* — > as A — > A c from above. 

Note that if 3>i(a, f3) was the exponent of the weight of (a,/3)-sets in a specific graph and not only as 
expected value then this graph would exhibit the bottleneck as stated in Theorem l2.2i Indeed, suppose that for 
A > A c (d) there exists a graph G with \V\\ = |V^| = n and maximum degree d for which = Q( e ^^) n ). 
It would then follow that fx[l B }/^[li] < nZ* 1 '*' /Z^ = n ■ Q{e n ^^ 'J^)-*a(parf»i)]) = Q( a ~ n ), where a = 
e 0i(p*,p*)-0 2 (p 2 ,Pi) > i. Similarly it would also follow that fi[l B }/ fi[l 2 ] < @(a,- n ), and the combination of the 
two inequalities would establish the existence of the desired bottleneck. 

Thus, if we could establish concentration for the random variables Z a ^, then Theorem 12.21 would follow. 
Note that since these variables are exponentially large, upper bounds are easily derived using Markov's inequal- 
ity. Specifically, since the expected value of the total weight of balanced sets is at most n ■ 0(e* 1 ( p * ,p *) n ) then 
for most graphs in Q(n, d) the total weight of the balanced sets is 0{e^ 1 ^ p * ,p ' n ). 

If we could additionally show that for A > X c (d) and for most graphs G in G(n,d), both Zq ,V2 and Z^' P1 
are r2(e* 1 ^ 1 ' P2 ^ ?1 ), i.e., these variables are within subexponential factors from their expected values, then the 
proof would follow. 

It is clear that such concentration cannot hold for general (a, ff). For example if d is large and a = f3 = 1/2 
then Zq will be except with exponentially small probability while its expected value will still be exponen- 
tially large. 

However, we conjecture that concentration does hold for some values of (a, ft), including the relevant 
values (pi,P2)- In particular, we show that for d > 3, this concentration holds in the neighborhood of the point 
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a = (3 = 1/d. The proof of the required concentration follows in two stages which we first describe somewhat 
roughly and then state them more formally. 

• First we show that for values of a and f3 close to 1/d, the second moment of Zq 13 is of the same order as 
the square of the first moment of Zq. In fact, we calculate the limiting ratio as n — > oo. Using the exact 
calculation and the second moment method actually allows one to obtain weaker version of Theorem l2.2[ 
where the result is obtained with positive probability, say probability at least 1/2, instead of a.a.s. 

• To obtain a high probability result we use the small graph conditioning method. This method "explains" 
the variance of Zq 13 by the interaction between the numbers of short cycles and the random variable Zq . 
For background on the conditioning method, see IE Theorem 9.12-Remark 9.18] and Il2~2~l Theorem 4.1] . 



Theorem 3.3 There exists an i](d) > such that for fixed a and (3 satisfying \a — l/d\ < n and \/3 — l/d\ < rj it 
holds that 



E g ( M )[(Zg A 21 

E G(n,d) { Z G Pl \ 



lim " ywLV " ti ' J = T a >P(d) 



(the limit is taken over all n such that na, n/3 are integer), where 

T a,p^ _ (1-a-P- a(3) c 



(1 - a - (3 + 2a/3)( d " 1 )/ 2 (l -a- 0)^- x )l 2 sJl -a- (3 + da/Hy/l - a - (3 - (d - 2) a/3 



Theorem 13.31 is proven in Section [5] and Section [6j Very roughly speaking, the main step of the proof is 
showing that for values of a,/3 close to 1/d the major contribution to the second moments comes from pairs 
of independent sets of size (a, (3) that are "uncoupled". This means for example that the intersections of these 
sets are of sizes a 2 and j3 2 respectively. Moreover, for each of the d matchings defining the graph, the number 
of edges between the a size set in one copy and a (3 size set in other copy is of size a(3. Once this is established, 
using Gaussian integration one obtains Theorem [331 



Theorem 3.4 There exists an r](d) > such that for fixed a and (3 satisfying \a — l/d\ < rj and \(3 — l/d\ < rj it 
holds a.a.s. that that ^ 

(the inequality holds for values of n such that na, n[3 are integer). 



The proof of Theorem |3.4l in Section [7] uses the small graph conditioning method. This method requires in 
particular the exact value of T a,l3 (d). We now use Theorem 13 .41 to prove the main result, Theorem l2.2[ 



Proof of Theorem 12. 21 Since A > X c (d), by Lemma [3^21 we may let (a, (3) = (a,f3)(X) be the maximum point 
of <I>i in which a < f3. Since $i is continuous we may choose 5 > such that if 

\x-a\<S\,\y-P\<6 x,y:\x-y\<8 

then 

e := T - A > 0. 

By Markov inequality we have 

Y, Z*d V < ex P (n(A + J)) (2) 

x,y:\x-y\<8 



8 



(note that there are at most n 2 terms in the sum above) . Note furthermore that without loss of generality we 
may chose 5 small enough so that the two sets 

{{x,y) :\x-a\ <26\,\y-(3\ <25}, {{x,y) : \x - y\ <25} 

are disjoint and 5 < n/4 where 77 = 77(d) is used in the statement of Theorem 13 .41 

By Lemma [3721 (a,/3) is a continuous function of A. Therefore there exists an such that if A c (d) < A < 
\ c (d) + Ed then \a — l/d\ < n/4 and |/3 — l/d\ < rj/4. We may now chose (ai,/3i), . . . , (atk,(3k) all satisfying 
|aj — a\ < n/4 and — /3| < 77/4 such that for all n large enough we have that one of n(a\, Pi), . . . , n(ak,Pk) 
is an integer point. From Theorem 13 .41 it follows that a.a.s. 

3i,Z^>eMn(T- e -)) (3) 

and a similar statement holds for (/%, a») instead of («j, /%). By (EJ we have that a.a.s. 

/i(2|) < exp(n(A + -)) 

and by ([3]) we have that a.a.s. 

min{/i(lJ), M (2f)} >exp(n(r-|)). 

Therefore we conclude that a.a.s. 

/zE4] < exp(-^)min{/4jf],/44]} 

as needed. □ 



4 Logarithm of the first moment 



Here we prove the claims made regarding the expected total weight of independent sets with a given pair of 
densities. The proof below emphasizes the role of the hard core model on the tree. 



Proof of Proposition 13. It Notice that for a given subset I that occupies an and (3n vertices of V\ and V2, 
respectively the probability that I is an independent set of the chosen graph G is simply the probability that all 
the d chosen matchings do not connect the subset of size an (of V\) with the subset of size j3n (of V2), i.e., in 
each of the matchings all the an edges connected to the a subset fall outside the j3 subset. Summing all subsets 
of density (a, /3) gives: 



Eg[Zf] 



n 

an 



exp < n 



(in. 



\ an I 
) 



,n 
\an 



(a + p) ln(A) + H(a) + H(/3) + d 



a 



(l-/3)H( r — -)-H(a) 



exp($i(a, (3) ■ n). 



□ 
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Proof of Lemma [3T2t We start by analyzing the first and second derivatives of $1. We refer to Claim 2.2 and 
its proof in Q| to establish the following: 



Proposition 4.1 (i) The function &\ has no local maxima on the boundary of T and at least one local maximum 
in the interior of T. 



(ii) Any stationary point (a, 0) of <3?i satisfies (3 = f(a) and a = f{(3), where 

i/d~ 

1 



f(x) = (l-x) 

(iii) All local maxima of <E>i satisfy a + (3 + d(d — 2)a(3 < 1. 



A(l-x) 



(4) 



We now claim that the function / defined in §4§ also describes the relationship between the probabilities of 
occupancy on neighboring vertices in the infinite regular tree T d . Specifically let p be a simple semi-translation 
invariant Gibbs measure for the hard-core model with activity A on T d . For any vertex v on the even (respectively 
odd) partition of the tree let pe (respectively po) stand for p[v is occupied]. (The definition does not depend 
on the choice of v since p is semi-translation invariant.) 

We will show below that pe and po must satisfy the same relationship as in part (ii) of Proposition 14. 1[ 
i.e., it must be the case that pe = f(po) an d po = I(pe)- Indeed, recursive relationships similar to / for the 
probability of occupancy on the regular tree have been studied in the analysis of Gibbs measures on trees. See, 
e.g., |[8"l[T5ll . To see that pe = f(po) notice that by definition of the hard-core model, for any Gibbs measure p 
and every vertex v, p[v is occupied] = ■ p[al\ the neighbors of v are unoccupied]. On the other hand, if p is 
simple then 

p[a\\ the neighbors of v are unoccupied] = 

d 

p[v is occupied] + p[v is unoccupied] J^J p[ui is unoccupied | v is unoccupied], 

i=l 

where the itj are the neighbors of v. Thus, if we let pe = p[v is occupied | u is unoccupied], where p is the 
semi- translation invariant measure under consideration and v, u are two arbitrary vertices connected by an 

edge with v even and u odd then we get po = j^t\(po + (1 — Po)(l —PE) d ), i- e -> Pe = 1 — Po d /{H^ ~ Po)) ■ 
Plugging the latter expression for pe into the trivial equation pe = (1 — Po)Pe gives pe = f {po) as required. A 
similar derivation with the roles of even and odd vertices reversed gives po = /(pe)- 

Since f(x) is decreasing in x, it follows that for all A and d there is a unique p* G [0, 1] such that p* = f(p*), 
which means there is always a unique simple translation invariant Gibbs measure on the infinite tree. However, 
the analysis in |(8]1 (see also lfT5ll ) shows that for A < X c (d) = (d— l) d ^ 1 /(d — 2) d this measure is also the unique 
simple semi-translation invariant measure while for A > A c there are two more measures of the latter kind. This 
means that for A < A c , a = f3 = p* is the unique solution to the system of equations a = f(/3) and /3 = f(a) 
while for A > A c there exists p\ < p* < p2 such that (a = pi, (3 = p%) and (a = p2,/3 = p\) are (the only) two 
additional solutions. It is also easy to verify that p* = 1/d at the critical activity A c . Notice that / is increasing 
in A and therefore so must be p*, i.e., for A > A c , p* > 1/d. By part (iii) of Proposition 14.11 we get that for 
A > A c the point a = j3 = p*, although stationary for $i, is not a local maximum. We conclude that for A > X c 
the maximum of $1 over the triangle T is achieved at (a = pi, /3 = P2) and (a = P2, (3 = pi). □ 



5 Logarithm of the second moment 

We would like to show concentration of Zq , at least for some a, f3, by using the second moment. We begin 
with the function describing the exponent. We introduce the overlap parameters 7 and 5 as follows. We 
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calculate the contribution of pairs of independent sets such that both independent sets have an vertices on 
the right and j3n vertices on the left, and the overlap on the right is 771 and the overlap on the left is 8n. The 
constant in the exponent of this sum is then: 

* 2 (a,/9,7,i) = 2(a + /?) ln(A) + 

H(a) + aH(^) + (l- a)H(?—t) + 
a 1 — a 

H(f3) + (3H(^) + (l-(3)H(^) + 

where ^2 is the logarithm of the probability that two independent sets of the structure with parameters 
(q, (3, 7, 5) remain independent when a random matching is added. This is given by: 

* 2 (a,ft7,M = (l-2g + ^)g( i _2 ) + |j )-g(7) + 

p - 2 ^ + 5 - ^ 1-2/^-, ' + w - w 5 ^) - (i - OTi5?i + 

(l-/»-7-«)g( IZ f5^ )-(l-«)g(gfg), 

where the three lines correspond to the probability of the following three events respectively. The first is that 
the jn edges connected to the intersection on the right avoid both sets on the left. The second is, conditioned 
on the first occurring, that the (a — j)n edges connected the first set on the right but not to the second avoid 
the first set on the left, where we sum over the number en of edges that avoid both sets (and therefore take the 
maximum over e). The last event is, conditioned on the first two occurring, that the (a — -y)n edges connected 
to the second set on the right but not to the first avoid the second set on the left. 

We would like to show that <£ 2 < 2<&i given in Proposition 13 . 1 1 for all relevant a, (3 and all 7, 8, e (where the 
latter quantities have to make sense, e.g., 7 < a, etc.), i.e., we want to show that: 



!>,/?, 7, <5,e) := H{a) + H((3) 



aH(l) + (1 - a)Hg—l) + (3Hd) + (1 - 0)H( P 



+ 



a 1 — a (3 1 — j3 

d ■ [2*! (Q )( 3)-$ 2 (a,j3,7, 8, e)] > . (5) 



Lemma 5.1 For all a, (3 the point (a, (3, 7* ,5* ,s*) is a stationary point ofboth <£ 2 and^ 2 , wherej* = a 2 ,5* = (3 2 
and e* = a(l — a — (3). 



Lemma 5.2 For all a, (3 the point (a, /?, 7*, 5*, e*) satisfies F(a, f3, 7*, 5*, e*) = 0. 



The meaning of the above two (easily verifiable) lemmas is that in order to show that T(a, j3, 7, 5, e) > for 
given a, [3 and all 7, 5, e we have to show that the stationary point (7*, 8*, e*) is in fact the global maximum of 
$2- While we believe this to be true for all relevant a, f3, we carried out the detailed calculations only for a, f3 
close to the critical values. 



6 Calculation of the 2nd moment around the critical point for d > 3 

Here we show that the second moment is tight when d > 3 and a = (3 = l/d. 
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6.1 The logarithm of second moment 



Recall that 

H(x) := —x log(x) — (1 — x) log(l — x). 

Let: 

Hi(x,y) := -x(log(x) - log(y)) + (x - j/)(log(y - x) - log(y)). 

Then we have: 

* 2 (a, /3, 7, 5, e) = #1(7, (1 - 2 * (3 + 8)) - H(y) + H^e, (1 - 2 * p + 6 - 7)) + 
Fa ((a - 7 - e ), (0 - J)) - if a ((a - 7), (1 - 7)) + 
Hi((a - 7), (1 - P ~ 7 - e)) - #i((a - 7), (1 - a)) : 

Therefore the important part of the log of the second moment is given by: 

f(a, (3, j,8,e) = 2 * (a + 0) * ln(A) + H(a) + #1(7, a) + 

Fi((a - 7), (1 - «)) + #00) + HtiS, P)+H 1 (p-6,l-P)+d* * 2 (a, ft 7, 8 e). 

We now find all stationary points in the region defined by all variables being nonnegative as well as follow- 
ing constraints: 

a-7-e>0, P-8>0, 1 - 2/3 + S - 7 - e > 0. (6) 

Lemma 6.1 Let a = P = 1/d. Then the function f as a function 0/(7,5,5) obtains its maximum the interior of 
the region ©. 

Proof: We know that for a,P, the independent point (a,P, 7*,<5*,e*) 3 where 7* = a 2 , 5* = P 2 and e* = 
qj(1 — a — p) is a stationary point of / and we would like to show that it is the global maximum for a = p = 1/d. 



Recall that 



and 



dHi{x,y) = y -x 
dx x 

dHt(x,y) , y 

- log(- 



dy y — x ' 



so the derivatives of / are: 

df\ (1 - 2P + 6 - 7 - e) d (q - 7 - e) d (1 - 2a + 7) d_1 



exp 



dj J 


(1 




(dj\ 


exp 


\dSJ 


df\ 


_ (1 


de j 





(1 - P - 7 - e) a (P - 5 - (a - 7 - e)) a (a -7) 2 7 



(/3 - J - (« - 7 ~ e)) d (1 - 2/3 + 
(1 - 2/3 + 5 - 7 - e) d (/? - 8) d ' 2 8 

(1 - 2/3 + 8 - 7 - e) d (a - 7 - e) d (1 - a - /3 - e) d 
(1 - /3 - 7 - e) d (/3 - 5 - (a - 7 - e)) d e rf ' 

For the proof of the lemma note that the derivatives (at least w.r.t. to one of the variables) go to infinity (+ 
or -, in the right direction) as we approach any boundary point of the region defined by ([6]) . 

The first derivative goes to 00 as 7 — > 0, the second as 8 — > and the third as e — > 0. Similarly, the second 
derivative goes to —00 as 8 — ► p the first ase + 7 — 5^1 — 2/3, and the third as 7 + e — > a. This implies that 
the global maximum must be obtained at an interior point. □ 
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In order to proceed we need the second derivatives of /. 



Lemma 6.2 

df_ 

<9 2 7 



85 dj 

df 
dedj 

d 2 5 

df 
dedd 
df_ 
d 2 e 



d 



d 



+ 



d-l 



d 



+ 



d 



+ 



d-2 1 



1 — 2(3 + 5 — 7 — e a — 7 — e 1 — 2a + 7 (3 — 5 — (a — 7 — e) 1 — (3 — 7 — e a — 7 7 
d d 



+ 



l-2/? + <5-7-e /3-5-(a-7-e) 
d d 



d 



+ 



d 



1 — 2(3 + 5 — 7 — £ a — 7 — £ (3 — 5 — (a — 7 — e) 1 — /3 — 7 — e 



+ 



d-l 



+ 



d-2 1 



/3-<5-(a-7-e) l-2j9 + <5 l-2/? + «5-7-e /?-5 5 
d d 



+ 



13-5 - (a - 7 - e) 1-2/7 + £- 7- 6 

(f d d 



+ 



d 



d 



d 



1 — 2/3 + 5 — 7 — e a — 7 — e 1 — a — /? — e 1 — (3 — 7 — e /3 — (5 — (a — 7 — e) e 



Proof: 



Next we find for fixed 5 and 7 the e which maximizes /. 



Lemma 6.3 For fixed values of a, 0, 5 and 7 the maximum of f is obtained for 

1 



e(7,S) = e(a,p, 7 ,S) = - [l + a - /3 - 2 7 - V(l - a - /?) 2 + 4(a - 7 )(/? - <$) 



Proof: Solving for e in the equation jfe = gives a quadratic equation whose unique solution in the legal 

Note that e is a 



range of e is: £(7, 5) = i(a, f3, 7, 6) = \ 1 + a - (3 - 27 - ^(1 - a - (3) 2 + 4(a - 7 )(/3 - 5) 

We 



maximizer since Mf- < throughout the region © . □ 



We now define the function 



9(7, S) = g(a, (3, 7, 5) = f(j, 5, £(7, 5)). 



The proof will proceed by showing that the characteristic polynomial of the Hessian matrix of g has only 
negative roots throughout the region defined by 



< 7 < a, 0<5< (3. 



(7) 



Lemma 6.4 The derivatives of g are: 



The second derivatives are: 



dg ( 
d 2 j 
dg 
85 d 7 
dg 
8 2 5 



(7,<5, e(7,<5)) 



dg , X s df 



df , ^ <9e 3/ . > 

dV''' e) + 57" d^d£~ { ' , ' ,£) 

df , - di df , 



(v) 



d5dj 
df 



O7 cMcte 



d 2 6 { ' r,£) 88 ' 85d£ [ ' , ' ,£) ' 



13 



where 



de , (3-6 

-1 + 



dl ^/{l - a - (3) 2 + A{a - ^{(3 - 5) 

de a — 7 

95 ~ ^(l _ a - ff)2 + 4 (a - 7 )(/3 

Proof: The calculation are straightforward, noting that ||(-, •, e) = 0. □ 

The major technical challenge is to prove that: 

Lemma 6.5 Let d > 3 and a = (3 = 1/d. Then the function f has a unique stationary point in the interior of the 
region ©. This point is 

1=6=^, e = a (l- 3 ) 

and it is the maximum of the function. 

Lemma [631 is proven in Lemma [6761 and Lemma [6771 below. 

Lemma 6.6 Let d > 4 and a = (3 = 1/d. Then the function f has a unique stationary point in the interior of the 
region ©. This point is. 

7 =<5 = cp, e - 3 (1- 3 J 

and it is the maximum of the function. 

Proof: Given Lemma [67T1 it suffices to show there is a unique local maximum of / in the interior of ©. Given 
Lemma IB731 it suffices to show that that g has a unique local maximum in the region ©. Using Lemma [6741 for 
a = (3 = 1/d we obtain: 

d 2 8 ffi K '''' e} ^~ as' d6di K '''' ' 

d d-1 d d-2 1 

- + ~, ttttt -g - ^ ; ; + 



-«5 + 7 + f l-2/d + <5 1-2/d + d-j-i l/d-6 5 
d d \ 1/d -7 



5 + 7 + s 1 - 2/d + 6 - 7 - e / ^(i - 2/d) 2 + 4(l/d - 7 )(l/d - 6) 
d d-1 d d-2 1 

+ Z z-n r - -n ; r + 



-8 + j + i 1-2/d + d l-2/d + S-i-e 1/d- 5 5 
d d \ 1/d -7 



-5 + j + e l-2/d + <5-7-£y l-2 7 -2e' 
Note that for d > 4 we have 

1/d -7 < l/d- 7 < 1 < 2 



1 - 27 - 2e - 1 - 2/d - d - 2 - d' 
Therefore 

<9 5 d-2 d-1 d-2 d-2 1 

W5 ~ ~ (3-5- (a-7-e) + 1-2/3 + 5 ~ 1-2/3 + 5-7-e + /? - 5 ~ 5 

We now claim that the last expression is negative throughout the region ([7]). This follows since (3 — 5 > 

(3 - 6 - (a - 7 - e), 1-2(3 + 8 > 1-2/3 + 8-7 - i and 1 - 2(3 + 5 > 5. We therefore conclude that for 
a = (3 = 1/d the square derivative < throughout the region (0) . 
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Since g is symmetric in 7 and 5 for a = (3 we conclude that J^- < throught the region as well. 
We now note that = and therefore we get that 

9 9 . g£ . _df_, . £) roi 
^7 7 dedS K,,) lj 

v / (l-a-/3) 2 + 4(a- 7 )(/3-<5) ' SeSJ l '' £) 

since > 0. Thus, since the square derivatives are negative, for any point (7, §), the 2nd derivative of g 
along the line connecting the point (7, 5) with its mirror (5, 7) is negative. (Notice that this 2nd derivative is 
the sum of the square derivatives in each variable minus twice the cross derivative.) Since g is symmetric for 
a = (3 = 1/d, we conclude that the maximum of g must be obtained on the line 7 = 5. 

What remains to be shown is thus that the 2nd derivative of g along the line 7 = 5 is negative. We proceed 
with a and (3 set equal to 1/d. 

This derivative equals to the sum of the square derivatives plus twice the cross derivative. Since g is 
symmetric and we are considering the line 7 = 5, it is enough to show that 

for every < 5 < f3 = 1/d. We use © and ((9]), express the square root using the formula in the statement of 
Lemma [631 and note that in the present situation (where 5 = 7) 

df df , , de de 1/d- 5 

and also 1 + 



ded5 dedj' <9 7 85 1 - 25 - 2e 

using Lemma [6~4l Thus, evaluating at e = e and 7 = 5, 

W5^ 6) + d^^ 5) = W + 2 d^5-d5- 

We find after some algebra that 

d 2 f -(d-2)P(d,5,e) 



d5 2 {d - 2 - ed)e{d - 2 + d5){l - d5)5' 

d 2 f de 2(d-2)(l-d5) 



(10) 



where 



ded5 85 i(d - 2 - id)(l - 26 - 2e) 

P(d, 5, e) = -5d 3 e - d 3 5 2 - d 3 5 3 + 5d 3 e 2 + 3d 2 5 2 + 2d 2 5e + 5d 2 - e 2 d + id - 2d5 - 2e 
= d(5d 2 - l)i 2 - (d - 2)(5d 2 - l)e + d5(l -d5){d-2 + dS). 

We may omit the common factors (d — 2)/(£(d — 2 — id)). Note that, according to ©, 5 + i < a = 1/d, and 
hence we see that the expression in (TlOl) is positive and may be bounded above using (1 — 2/d) in place of 
(1 — 25 — 2i). Thus, it is enough to show 

P(d,5,£)(l - 2/d) - 25(d - 2 + d&)(l - d5) 2 > 0. (11) 

Note that the formula for e gives £ = |(a + b — Va 2 + b 2 ) where a = 1 — 2/d and b = 2/d — 25. Realizing the 
numerator gives 

A _ \ (a + b + Va 2 + b 2 )(a + b - Va 2 + b 2 ) _ ab _ 2(1 - 2/d)(l/d - 5) 

a + b + Va 2 + b 2 a + b + Va 2 + b 2 a + b + Va 2 + b 2 
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Since both a and b are nonnegative, the denominator is at most 2(a + b) = 2(1 — 25), and so 

_ (l-2/d)(l/d-fl 

£ > £min - x _ 2J 

For an upper bound we will use, as from ©, e < 1/d — 5. 
Now consider the parts of P(d, 5, i) involving £: 

(l-5d 2 )({d-2)£-d£ 2 ). 

The second factor is monotonically increasing in e and positive up to 1/d (as d > 4). So for 5 < 1/d 2 , when the 
first factor is positive, P is bounded below by substituting £ = e m i n . Similarly, for 5 > 1/d 2 , we obtain a lower 
bound by substituting £ = 1/d — 5. 

Substituting £ = 1/d — 5 into the left hand side of (fTTT) gives 

(1 - d5){2d 4 5 3 + Qi(d, 5)) 
d 2 

where 

Qi{d, 5) = 2d 3 (d - 3)5 2 - d(d - l)(d - 2)5 + {d- 2){d - 3). 
This quadratic in 5 has its minimum at 5 = (d — l)(d — 2)/4d 2 (d — 3), where its value is 

which is easily seen to be positive for d > 4. Thus for d > 4, we have (fTTl) and are done in the case 5 > 1/d 2 . 

Next consider 5 < 1/d 2 . Substituting £ = £ m i n into the left hand side of (fTTT) gives 

(l-d5)Q 2 (d,5) 
d 4 (l - 2<5) 2 

where 

Q 2 (d, 5) = 8d 6 5 5 + M 5 (d - 8)5 4 + d 4 {-6d 2 + 8d + 32)5 3 + d 3 (10d 2 - 20d - 8)5 2 
+d(d - 2)(d 3 - 9d 2 + 8d- 4)5 + {d-l)(d- 2) 2 . 

Rewrite this as 

Q 2 (d, 5) = 8d 6 5 5 + (4d 5 (d - 8)5 4 + d 4 (8d + 32)5 3 ) + d 3 (Wd 2 - 20d - 8 - 6d 3 5 - 3d 2 + 6d)5 2 
+ (3d 4 (d - 2)5 2 + d(d - 2)(d 3 - 9d 2 + 8d- 4)5 + (d - l){d - 2) 2 ). 

We argue that each of the four terms here is nonnegative (and the last is strictly positive) for 5 < 1/d 2 . The 
first is immediate. The second is clearly nonnegative for d > 8; otherwise the first part is minimized by 5 = 1/d 2 
which makes the whole term nonnegative. For the third term, the big factor is at least 7d 2 — 20d — 8 > as 
d > 4. The last and longest term is quadratic in 5. Its derivative with respect to 5 is, using the upper bound 
1/d 2 for 5, easily seen to be negative for all d < 7. So for such d we may substitute 5 = 1/d 2 , and noting 

d 3 -9d 2 + 8d-4 = d(d-8)(d-l)-4 (12) 

this whole term becomes 

3(d - 2) + (d - l)(d - 2)(d - 4)(d + 1) - 4(d - 2)/d > 0. 

On the other hand, to cover the case d > 8 for the last term, ignoring the (clearly nonnegative) 5 2 term gives a 
linear function with positive constant term, so it is positive provided 

did - 2)(d 3 - 9d 2 + 8d - 4)4 + (d - l)(d - 2) 2 > 0. 

d l 



This is obvious for d > 8, using (fl~2l) . 

This completes the proof that for d > 4 and a = j3 = 1/d the function / has a unique stationary point which 
is the global maximum of /. □ 
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Lemma 6.7 Let d = 3 and a = (3 = 1/3 
region ©. This point is. 

and it is the maximum of the function. 

Some of the algebra the proof of the lemma below was performed using MAPLE. In particular, MAPLE was 
used in order to symbolically factor polynomials and calculate resultants. 

Definition 6.8 Letx = (xi, . . . ,Xk) be avector of variables. Letp(x,y) = Y^iLoPii^y 1 an dq(x,y) = YliLoQi( x )y l 
be two polynomials of degree d\ and d2 in the variable y. The resultant of p and q with respect to y, R(p, q; y) is 
the polynomial of x defined by the determinant of the Sylvester matrix of the two polynomials. 

The Sylvester matrix of p and q is the (d\ + c^) x (d± + c^) matrix defined by: 







PO 















Pd 1 




Po 



































Pdi 




Po 
















Pd 1 




Po 


qd 2 




qo 
















qd 2 




qo 



































qd 2 




qo 





Vo 










qd 2 




qo) 



. Then the function f has a unique stationary point in the interior of the 

7 =5 =e =- 



In particular, in the proof of the lemma, we will often use the following well known result: 
Fact 6.9 If (x, y) is a root of both p and q then x is a root of their resultant R(p, q; y). 



Proof: Given Lemma [6TT1 it suffices to show there is a unique local maximum of / in the interior of ©. 
We take the derivatives of / to obtain: 

df\ (e- l + 2/?-<5 + 7 ) 3 (-a + 7 + e) 3 (l-2a + 7 ) 2 



ex P ~r 



djj (p-S- a + j + ef(j-a){-l + p + j + e) 3 7 
df\ {2/3-5- l) 2 (a- 7 -e-/3 + 5) 3 



exp 



dSJ ( e - 1 + 2/3-5 + 7) 3 (0-5)6 



fdf\ = (e - l + 2/3-^ + 7 ) 3 (q-7-e) 3 (a - l + /3 + e) 3 
eXP \de) ~ e 3 ( a - ry _ e _ [3 + §f (_i + p + 7 + e ) 3 

In particular, in order for (1/3, 1/3, 7, 5, e) to be a stationary point, we must have equality between the numera- 
tor and the denominator in the expressions above. In other words, if (1/3, 1/3, 7, 5, e) is a stationary point then 
it is a zero of the following three polynomials 



G := -(e-l + 2/3-5 + 7 ) 3 (a-7-e) 3 (2 a - 7 - l) 2 - 7 (a - 7 - e - /? + 5) 3 (a - 7) (-1 + /3 + 7 + e) 
D := (2f3 -5 -if (a-j -e- P + 6) 3 -{e-l + 2p -5 + jf(f3-5)5 
E := -5a + (^7 + 7 - 2/?7 - 7 2 + e - f3e - 2<ye - e 2 - a + 2 (3a + 07 + ae, 



where, for the last equation, we used the fact that ar = implies x = y for real x and y. 
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When we substitute a = (3 = 1/3 into these equations we get: 

G = - (e - 1/3 - 5 + 7 ) 3 (1/3 - 7 - e) 3 (-1/3 - 7 ) 2 - 7 (- 7 - e + 5) 3 (1/3 - 7) (-2/3 + 7 + e) 3 
D = (-l/3-<5) 2 (- 7 -e + <5) 3 -(e-l/3-<5 + 7) 3 (l/3-<5) ( 5 
£ = _i/35 + £ 7 + 2/37-7 2 + e-27e-e 2 -l/9. 

In order to proceed, we eliminate variables using Fact 16.91 and calculating resultants. We let R(G, E) = 
R(G, E; 5) and R(D, E) = R(D, E; 5). Then 

R(G,E) = — — (81 7 2 e 2 + 81 7 e 3 - 27 7 2 e - 27 7 e 2 + 9e 3 + 3 7 2 + 9 7 e - 7) (3 7 + 3e - l) 3 (3 7 + 3e - 2) 3 
59U49 

and 

R(D, E) = — (3 7 + 3 e - 1) (243 7 4 e 2 + 972 7 3 e 3 + 1458 7 2 e 4 + 972 7 e 5 + 243 e 6 - 81 7 4 e - 648 7 s e 2 - 1566 7V 
729 

- 1512 7 e 4 - 513 e 5 + 9 7 4 + 108 7 3 e + 486 7V + 756 7c 3 + 369 e 4 - 6 7 3 - 45 7 2 e - 132 7 e 2 

.3 , 2 



-105e 3 + 7 2 + 67e + 9e 2 ). 



Note e + 7 = 2/3 is impossible since e + 7 < a in © . 

We can eliminate £ + 7= 1/3 as follows. Substituting e = 1/3 — 7 into equation G gives 

' :7 (3«5-l) 3 (-l + 3 7 ) 



2187 

and the only zeros of this are at 5 or 7 = 1/3, which are on the boundary. 
It thus suffices to consider zeros of 

R(G,E) := 8l7 2 e 2 + 8l7e 3 - 277 2 e-277e 2 + 9e 3 + 37 2 + 97e-7 

and 

R(D, E) := 243 7V + 972 7 3 e 3 + 1458 7V + 972 7 e 5 + 243 e 6 - 81 7 4 e - 648 7V 
- 1566 7 2 e 3 - 1512 7 e 4 - 513 e 5 + 9 7 4 + 108 7 3 e + 486 7V + 756 7e 3 + 369 e 4 - 6 7 s - 45 7 2 e 

- 132 7 e 2 - 105 e 3 + 7 2 + 6 7e + 9 e 2 . 

We let R = R(R(G, E; 5),R(D, E; 5); 7), and find that 

R = -243 (9e-l)(27e 2 -9e + l) (81 e 4 - 81 e 3 - 27 e 2 + 12 e - l) 
(1296 e 4 - 1917 e 3 + 840 e 2 - 97 e + 6) (3 e - l) 2 e 4 . 

The zero e = 1/9 will be investigated below. The case e = 1/3 is on the boundary (it forces 7 = by ©) so 
is not of interest at present. Similarly e = is on the boundary. The other factors are 

27e 2 -9e + l 

81e 4 -81e 3 -27e 2 + 12e-l 

1296 e 4 - 1917 e 3 + 840 e 2 - 97 e + 6 

These polynomials have no roots in the required range as can be verified by elementary calculus. In the case of 
the two quartics, the plots in Figure [1] might help convince the reader of this. 
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0.05 0.1 0.15 0.2 0.25 0.3 



Figure 1 : The two quartic polynomials 



So now we consider the only remaining case, e = 1/9. Substituting e = 1/9 in D and G we obtain 

D := (-1/3 - 5) 2 (-7 - 1/9 + 5) 3 - (-2/9 - 5 + 7 ) 3 (1/3 -5)5 

and 

G := -(-2/9 + 5 + 7) 3 (2/9 - 7 ) 3 (-l/3 - l) 2 - 7 (~7 " 1/9 + «5)(l/3 - 5 )(-5/9 + 7 ) 3 - (13) 
Letting R = R(D, G, 7 ) we find that i? factors as 

1 

984770902183611232881 

where 



(9<5-l) (1458 5 J + 405 5 2 + 24 5 + l) (35- l) d P(5) 



= 1549681956 5 n + 2970223749 5 10 - 157837977 5 9 - 36669429 5 s + 42830208 5 7 
-35446896 5 6 - 4331961 5 5 + 1160487 5 A + 227345 s + 47529 5 2 + 12720 5 + 64. 

We will consider the case 5 = 1/9 later. Note that 5 = 1/3 is on boundary. The other factors are: 

14585 3 +4055 2 + 245 + 1 

which is clearly positive for 5 > 0, and P(5) which is positive as shown by the plots for different intervals in 
Figure 2. 

More formally, this can be verified as follows. We first observe that the polynomial 

-1578379775 s - 36669429 5 7 - 35446896 5 5 - 4331961 5 4 + 12720, 

is decreasing and positive in the interval < 5 < 0.18. So P(5) is positive for all such 5. Then expanding 
P(5 + 0.18) we obtain that 

P(5 + 0.18) > 3000 + 90005 - 3 * 10 5 5 2 - 1.5 * 10 6 5 3 

for 5 > (all other monomials have positive coefficients) . It is easy to verify that the polynomial on the right 
hand side is positive for 5 € [0,0.09]. Finally, looking at the polynomial P(5 + 0.27) we see that all of its 
coefficients are positive. 
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Figure 2: The degree 11 polynomial P(5) 



The conclusion is that any interior stationary point must satisfy e = 1/9, S = 1/9. Substituting these 
into (fl~3l) gives 

(-1 + 3 7 ) (9 7 - 1) (6561 7 5 - 5832 7 4 + 1377 7 3 + 18 7 2 - 36 7 + 8) . 

Zeros are at the boundary (7 = 1/3) or the crucial value 7 = 1/9. The large factor has no zeros in the relevant 
range. This follows since both polynomials 6561 7 2 — 5832 7 + 1377 and I87 2 — 367 + 8 are positive for all 
7 6 [0, 1 /4] . Moreover for all 7 it holds that 6561 7 2 - 5832 7 + 1377 > 80 and 8O7 3 + 18 7 2 - 36 7 + 8 is positive 
in the interval [0.25, 1/3]. (See Figure^) 




0.05 0.1 0.15 0.2 0.25 0.3 



Figure 3 : The quintic polynomial 

In order to conclude we have to show that the point 7 = J = e = l/9isin fact a local maximum. For this 
we calculate the Hessian matrix of the function / at that point to obtain: 

■ _243 81 _243 " 
4 2 4 

81 _81 81 
2 2 2" 

243 81 405 
4 2 4 . 
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The characteristic polynomial of this Hessian matrix is 

o 405 2 45927 531441 

x H x H x H 

2 8 16 

and all roots are less than zero, so we have a local maximum here. This concludes the proof of Lemma [631 

□ 

We would now like to conclude that for a and (3 close to 1/d it holds that there is a unique maximum at the 
stationary point (a,/3,j*,S*,e*). By concavity one obtains 

Lemma 6.10 There exists v > such that if \a — l/d\ < v and \(3 — l/d\ < v then the function g(j,5,e) = 
f(a, (3, 7, 5, e) has a unique stationary point in the interior of and this point is its global maximum. 

Proof: First note that since the function / is continuous it follows that for sufficiently small v the maximum of 
the function g cannot be obtained on the boundary of the region. Therefore g has at least one local maximum. 

From the continuity of the derivatives of / it follows that for v sufficiently small, all stationary points of g 
have to be 77-close to the point (l/d 2 ,l/d 2 ,l/d(l — 2/d)). Moreover v may be chosen such that g is concave 
down on the n neighborhood of (1/d 2 , 1/d 2 , 1(1 — 2/d)). However, this implies that g has a unique stationary 
point and it is a maximum. The proof follows. □ 



6.2 The ratio of second to first moment 



So far we have only dealt with the logarithms of the first and second moments. In order to apply the second 
moment method we need to consider the ratio between the moments more precisely. Using the quadratic 
behavior of the function / around the stationary point we obtain: 

Theorem 6.11 Let d > 3. Then there exists rj > such that if \a — l/d\ < r\, |/3 — l/d\ < rj, the number n is 
sufficiently large and an, fin are integers then 



E g(M) [(^) 2 ] 



1 



E 



Z. 



(l + o(l))-- 



1 



G{n,d) L G 



2tt a(l-a)P(l-P) 



00 ;>oo 



d'ydd 



00 j —00 



'(!-«)(!-/?) 



' —00 
„2 x* Pii 



de exp \ — (7, 8, e)H f (7, 5, e) 



y / 2~7r 1 — a — (3 V a(3 
where Hf is the Hessian of the function f at the point a, (3, 7* = a 2 , 5* = (3 2 ,e* = a(l — a — (3). 

Proof of Theorem (6TTT} We use the approximation 



n 
an 



:i+o(i)) 



\/2im y/a(l — a) 



. p H{a)n 



and thus 



We have: 



bn\ , , ,. 1 / b 

= (1 + 0(1))-=../— 

an) V2in V a{b - a) 



, bH{a/b)n 



Eg[(Z^)} 



(' „ 

\an J \f3n 



((l-(3)n\ 
\ an I 

(") 

\ctnl 



(14) 
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and 



Eg[(Z^) 2 } 



n \ In 



E 



an\ f (1 — a)n\ ( /3n\ /(l — /3)n 



lan / \ (5n J ' V 771. / V (a — 7)77. 7 \Sn J \(0— 5)n 

7,(5 

f(l-2/3+<S)ra\ /(l-2/3+5-7)n\ / (/9-<5)n \ /(l-/3- 7 - £ )n\ 

I 771 / V £71 / V(o— 7— e)ra/ V (a— 7)n / 

\7n/ e 



Thus, 



E g [(^) 2 ] 



n \ In 



/(l-7)n\ 
\(a— 7)71/ 



f(l-a)n\ 
\(a— 7)n/ 



c(l-/3)n\ 
v an / 



-2d 



an / \/3n 

^ an^ ^(l — a)n\ ( pn\ f (1 — /3)n 



7,5 

/(l-2/3+5)n\ 
\ 7n / 



7ny \(a — 7)n/ \5n J \(/3 — 8)n 

E 



■yn 
V771/ 



(l-W- 7 )nW (,3-<5)n \ /(l-/3- 7 -e)m 
en / V(o— 7— e)n/ \ (a— 7 )n / 



/(l- 7 )n\ 
\(a— 7 )n/ 



/(l-a)n\ 
V(o— 7 )n/ 



Using the above approximations, we get 



E g [(^) 2 ] 



(l + o(l)) 



E 

7,5 



V2 



7rn 



d+2 



a(l - a)/3(l - (3) 



l-a-p 
(1 -<*)(!-/?) 



1 



(a - 7 )09 - <J) y 7<5(1 - 2a + 7 )(1 - 2/3 + 5) 

e [-fr(a)-HC9)+aH(5)+(l-a)H(fE2)+/JH(|)+(l-/9)H(?E|)]n 

[V (1 - 2/3 + J)(l - 2a + 7 )(/3 - 5)(a - 7) 

E 



(1 - P - 7 - e) 



e(l - 2^ + 5 - 7 - e)(a - 7 - e)(/3 - 5 - a + 7 + e)(l - /3 - a - e) 

d 



,[* 2 (a,/3,7,'5,e)-2* 1 (a,/3)]rj 



(1+0(1)) 



E 

7,5 



V2 



7rn 



d+2 



a(l - a)/3(l - f3) 



1-a-p 

(MM) 



1 



1 



(a - 7 )(p-S)]j 75(1 _ 2a + 7 )(1 - 2/3 + 5) 
[V(l - 2/3 + <J)(1 - 2a + 7)(/? - 5)(a - 7) 

E 



(1-/3-7 -e) 



y e(l - 2/3 + 5 - 7 - e)(a - 7 - e)(/3 - 5 - a + 7 + e)(l - P - a - e) 

r(a,/3, 7 ,<5,e) 

e <* 



For d > 3 and a,/3 close to 1/d, it follows from Lemma [6.101 that the function T(a, P, 7, <5, e) has a unique 
maximum and that the function decays quadratically around this point. This implies that all terms in the sum 
above that have 7, 5 or e more than $7(1) away from the maximal value have an exponentially low contribution. 
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Thus, up to losing a factor of 1 + o(l), we can plug in the above values of 7, 5, e except into the exponential 
terms. This gives: 



(l + o(l)) 



1 



2irn a(l-a)P(l-P) 



E 

7,5 



1 



V2 



nn 



1-a-P 



(1 — a)(l — 0) ^ EMtM„ 



Finally we use the quadratic approximation of T(a, /3, 7, 5, e) around a,P, r y*(a,P),5*(a,P),e*(a,P) and the 
standard approximation of integral by sums to arrive at an integral formula. For this we recall that 

T(a,P,j*,S*,e*) = 

and we note that the Hessian of T equals the Hessian of / as T(a, (3, 7, S, e) = f(a, 0, 7, 5, e) + 2$i (a, 0) and $1 
does not depend on 7, <5, e. We thus obtain (fT4b . □ 



We are in a position to give the proof of Theorem 13.31 



Proof of Theorem |3.3t In order to prove the theorem, we need to calculate the expression in ( fT4l ). Using the 
derivatives calculated in Lemma [6721 we have: 



where 



/in 

fl23 
^33 



a + d-2 j3 + da 



+ 



/ill 


hi2 


hi 3 \ 




ft 12 


h 2 2 


h 2 3 




7*13 


h-23 


w 






d 






(1- 


a)(l 







d 

0(1 - 0)(1 - a - 0) 

d(l-a-2p + 2aP + p 2 ) 
~a0(l - a)(l - -a-P) 

1 — a — P + daP 
~ 02(1 -0)2(1- a- 0) 
d 



We let 



/?(l-/?)(l-a-/3) 

d(l - a - P + 2ap) 
~ aP(l - a)(l - 0)(l-a- 0)' 



5(7,5, e) = — (7, S,e)H f (j, <5,e)* 



and observe that this is quadratic in e. Then putting 



fe( 7 ,a) 



1 



1 



aP 



^/2vf 1- a- P 
as required, Gaussian integration gives 

fc(7,<5) = (^ d exp(- J B (i ) 

where 

(l-a)(l-/3) 



(i-q)(i-fl r d£eg(7 ,, £ ) 



- a - /? + 2a/3)(l - a - 0) 
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and Bd is much more complicated and is a quadratic polynomial in 7 and in 5. In fact 



where 

A = 

B = 
C = 



B 2 - AAC 
Bd= 4A 



2(3a - (3 + 1 - a 
2(3a{\ - (3-a + (3a){(3 + a - 1) ' 
-cry + 2/?7« + 7 - 2(3^ + (3 2 j + 5a 2 - 5a 
f3a{-a + (3a + 1 - 2/3 + (3 2 ){a - 1) ' 
7 2 (2d/3a 3 - da 3 + 2d(3 2 a 2 - 5da 2 {3 + 2da 2 + d(3 3 a - 3dp 2 a - f3 2 a + M[3a + (3a - da - (3 3 + 2(3 2 - (3) 



2da 2 {(3 - 1)(-1 + (3 + a){a - 1) 2 (3 

6j (jd(3 2 - yd/3 + d(3a5 - 5a + 5 - 5(3)5 

+ 2(3{-a + (3a + I - 2(3 + (3 2 ) + 2d(3 2 {-a + (3a + 1 - 2(3 + (3 2 )(f3 - 1) ' 



Integrating again, we obtain 

1 1 



2?r a(l - a)(3(l - fi) 

as required. □ 



00 coo 



djd6h(i,5) = r a 'P(d) 



00 j —00 



7 Asymptotically almost sure results 

In this section we prove Theorem |3.4l using the small graph conditioning method. 

The small subgraph conditioning method has some chance of applying to a random variable Y when the 
variance Var(y) is of the same order as (El") 2 . This is indeed the case for Zq 13 for a and (3 such that the 
conclusion of Theorem l3.3l holds. In random regular graphs the only interesting local structures that occur with 
nonvanishing probability are short cycles. The method usually "explains" the variance of Y by the interaction 
between the numbers of short cycles and the random variable Y. For details, see Q Theorem 9.12-Remark 
9.18] and Il22l Theorem 4.1, Corollary 4.2]. The following is a simplification of the latter. Here [x] m = 
x(x — 1) • • • (x — m + 1) denotes the falling factorial and a.a.s. denotes "asymptotically almost surely," i.e. the 
probability tends to 1 as n — > 00. 

Theorem 7.1 Let Aj > and 5 > —1 be real numbers for i = 1,2, Let oj(n) —> and suppose that for 

each n there are random variables X{ = Xi(n), i = 1,2,... and Y = Y(n), all defined on the same probability 
space Q = Q n such that Xi is nonnegative integer valued, Y is nonnegative and E Y > (for n sufficiently large). 
Suppose furthermore that 

(i) For each k > 1, the variables Xi,...,X^ are asymptotically independent Poisson random variables with 

YiXj — > A,-, 



(ii) for every finite sequence mi, ... , rrik of nonnegative integers, 

I: 

■ [Xk\m k ) _^ ■ 

EY 



E{Y[X 1 U ---[X k ] mk ) ^ (A . (1 + k))mi (15) 

(Hi) Y.i ^ 5 2 < 00, 



i=l 
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(iv) EY 2 /(EY) 2 < exp(^ Ai^ 2 ) +o(l) as n -► oo. 



Then 1" > lo(u)EY a.a.s. 



The probability space Q n we are working with is the set of bipartite (multi) graphs obtained by taking d 
random perfect matchings between two sets V\ and V2 of n vertices each. This probability space is contiguous 
with a uniformly random d- regular graph (see the note after the proof of Ifl6l Theorem 4]), and hence, once 
we have verified the hypotheses of the theorem, Y > is a.a.s. in the uniform model as well (as well as various 
other models contiguous to it) . 

Let 

Y = \-i*+P) n z^ (16) 

be the number of independent sets with an vertices in V\ and fin in V2. Let the variable Xi be the number of 
cycles in the graph of length i. (We will apply the theorem only for even integers i, which is valid by a trivial 
change of notation.) 

Theorem 7.2 Let a and (3 be such that the conclusion of Theorem \3.3\ holds. Then the random variables Y defined 
in (fT6l ) and X{ defined by the number of cycles of length i (for even i), satisfy the conditions ofTheorem \7.1\ 

Proof of Theorem \3Al By TheoremEland Theorem [77T]it follows that a.a.s. we have Y > ^EY. This clearly 
implies that 

Z a / > -EZ a / 
n G 

as needed. □ 



An alternative valid conclusion of the theorem (not as quoted above) in this application, is that, if we 
sample from the random graphs with weight proportional to the number of independent sets, then the model 
of random graphs we get is contiguous to the original: events that are a.a.s. true in one model are also a.a.s. 
true in the other. 

We now prove Theorem l7.2| 



Lemma 7.3 Condition (i) holds with 

K = T -^- (17) 

i 

where r(d, 1) is the number of ways one can properly edge color a cycle of length i with d colors. 



Proof: This follows using the standard techniques (e.g. see Bollobas book or [22]). The reasoning goes as 
follows. There are asymptotically ri 1 /i positions for the cycle to be in, and, given the perfect matchings that the 
edges of a given cycle belong to (r(d, i) possibilities), the probability it occurs is easily seen to be asymptotic to 

n~\ □ 



For part (ii), as usual we do a calculation for E(Y Xj) /E Y which determines 5, L , and observe that the same 
calculation is easily extended to the arbitrary moments required for verifying this part of the theorem with the 
value of 5 so obtained. 



Lemma 7.4 

where 



St 



(1 - a y/ 2 (1-py/ 2 ' 
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Proof: Note that 

E (^) = EE P ^,rA^) 

S,T C 

where S and T denote subsets of V\ and V 2 of sizes an and fin respectively, C denotes a possible position of a 
cycle (not joining any vertices of S and T), A S: t is the event that the random graph has SUT as an independent 
set, and A c the event that it contains C. Similarly, 

E(Y)=Y,P(Mt). 

S,T 



We proceed with i even. We have 



, 2 



where 



• The leading factor 1/i accounts for the fact that we will count cycles rooted at a vertex in V\ (which can be 

done in i/2 ways) and oriented (2 ways), 

• S and T denote subsets of V\ and V 2 of sizes an and (5n respectively, 

• £ denotes a proper (i-edge-coloured rooted, oriented i-cycle (r(d,i) possibilities), in which the vertices are 

2-coloured, black and white, with no two black vertices adjacent. The color of the edges will prescribe 
which of the d perfect matchings an edge of a (potential) cycle will belong to. The black vertices will 
prescribe which of the cycle vertices are members of S U T. 

• n denotes a position that an i-cycle can be in (i.e. the exact vertices it traverses, in order) such that prescrip- 

tion of the vertex colors of £ is satisfied, 

• Pi is the probability that a random graph in £1 contains a cycle C in the given position n with the edge colors 

prescribed by £ in accordance with which matchings contain the edges of C, 

• P 2 is the conditional probability that the random graph respects S U T as an independent set, given that it 

contains C as in the definition of P\ . 

Since all quantities concerned are independent of S and T (provided they have the correct cardinalities), 
we can fix S and T and write 

E(YXj) _ 1 vv PiP 2 

e v 

where P 3 is the probability that the random graph respects S U T as an independent set. 

As noted before, P3 is the probability that there are no edges between T and S in each of the d matchings. 
In other words, 

[(1 ~ P)n]a n X d 
\p\ an 

For £ = 1 and 2 let je(0 denote the number of black vertices in the coloring prescribed by £ that lie in V*. Next 
we show that P 2 asymptotically depends only on ji- For k = 1, . . . , d, let e(k) denote the number of edges of 
color k in £. Let fi(k) denote the number of edges of color k adjacent to black vertices of 5 and f 2 (k) denote 
the number of edges of color k adjacent to black vertices in T. Then given £ that contains no edges connecting 
S and T, the probability that S, T is an independent set is given by: 

JL [(l-j3)n-e{k) + h(k)] {an _ h{k)) 



k=i 



[n - e(k)} {an _ fl{k)) 
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In order to calculate the asymptotics of P2 / P3 we observe that 



ni 



((1 - a )n)Mk) N.n ^ ((1 _ a)n)Mk){ 1 )<k) 

[n-e{k)\ (an-h (&)) F-e(fc)Jan 1-a 



and similarly 



[(1 - /3)n - e(k) + / 2 (&)](an-/i(fc)) 



((l-a-/?)n)^W( 



h(k)( l -f ) \e(k)-f 2 (i) 



1-a- 0' 



Therefore 



P> 



((l_ a ) w )E>/iW(^_)E < e(fc)((i_ a - ; g) w )-E < A(fc)( 1 I g )E fc / 2 W-e(fc) 



P3 v v v 1 — a ' v 1 — a — /? 

Recalling that J2k /iW = 2 iii Sfc = 2 J2 and £ fc e(fc) = i we obtain that 

P 2 (1 - a - /3)i-2ii-2j 2 



P 3 (l-a^-^Cl-zS)*- 2 ^" 



Clearly 



and the number of terms in the summation over 77 for which £, T may be an independent set (i.e. number of 
possible i]) is asymptotic to 

Thus (fl~8l) is asymptotic to 

- E "<,r.,,-'V'-- ,: ' (19) 



where aij u j 2 is the number of possible £ of length i with black vertices in Vg (£ = 1 and 2), and 



l-a-/3 



V(i -a)(i-py 



a(l — a) 
(1-a - f5f 



1(1-/3) 
(1-a-/?) 2 ' 



Define the matrix 

1 
y 1 
10 
z 1 

Each entry of A refers to a transition from one state to the next as we traverse the cycle C. The first row and 
column refer to a black vertex in V\, the second to a white vertex in V\, the third to a black vertex in V2, and 
the fourth to a white vertex in V2 ■ Then the trace of A 1 counts the possible £ weighted by y jl z^ 2 , except for the 
edge coloring, of which there are r(d,i) possibilities. Hence (fl~8l) is asymptotic to r(d, i)x l tr(A l )/i, with x, y 
and z defined as above. Letting /x^, k = 1 . . . 4 denote the eigenvalues, a little computation gives 



Mi 



M2 = 2 ( M + V 7 " 2 - 4v j . 



M4 



where u = 1 + y + z and u = yz. Substituting the values of y and z gives 



Ml 



a/3 



(l-a-/3) 2 ' 



2 _ (l-a)(l-/3) 

^3- (l_ a _/3)2 ' 
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Thus, recalling that i is even, and ti(A l ) = 2fi\ + 2/4, we have from (1181) 

E(YXi) r(d,i)x* a 1 / 2 ^ 2 r(d,i)j (1 - a)*/ 2 (l - /3)^ 2 
E7 ~ i (l-a-^i + i (l-a-/?)* 

r(d,«) a*/ 2 /?/ 2 r(d,i) 



Now ([171) gives that in (fT5l) . 



for even i > 2. □ 



i (l-a)*/2(i_/?)^ 
aV 2 /^/2 



+ 



(1-^/2(1-^/2- 



Verification of (ii) for arbitrary sequences mi,... is based on a straightforward extension of the above 
argument which we sketch briefly. 



Lemma 7.5 For every finite sequence mi, ... , m^ of nonnegative 

k 

E7 



E(y[x 2 ] ^[x 2fc]m j ^ ^ (At(i + 



i=l 

Proof: As in the previous case 

E(Y[X 2 ] mi ---[X 2k ] mk ) = J2 E P(^,tALi^), 

S^T C\ ,. . .,C r 

where r = Ym=i m i an ^ i R trie sum Ci, . . . , C mi are different cycles of length 2,C mi+ i, . . . , C mi+m2 are different 
cycles of length 4 etc. It is easy to see that the contribution to the sum coming from the cases where two of the 
cycles intersect is o(l). Therefore it suffices to consider disjoint cycles. 

We now repeat the previous argument where 77 and £ will refer to r disjoint cycles. We then obtain the 
same formula for P2/P3, where now i is the total length of the cycles, j\ is the total number of black vertices 
in V\ covered by cycles and j 2 is the total number of black vertices in V 2 covered by cycles. Finally in order to 
evaluate the sum corresponding to ( fl~9l ), we note that it factorizes as a power of the sums for individual cycles. 

□ 



Lemma 7.6 

Aj 5 2 < 00 

even i>2 

and 

exp( Y, htf)=r a ^(d). 

even i>2 

Proof: Finding r(d, i) is a well-known problem; one can for example solve the recurrence 

r(d, i) = d{d - I) 1 ' 1 - r(d, i - 1) 

to obtain 

r{d,i) = (d-l)* + (-l)*(d-l) 
and again we only pay attention to i even. 
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We now have 

£ **» = £ T«"-^ + C-'))( (1 - tt ^-J 

even i>2 even i>2 vv /v 

~ '((l-«)(l-fflj +( ''" 1)<, ((l-«)(l-ffl 
where = -|(ln(l - s) + ln(l + x)) = -|(ln(l - x 2 )). We find that 

{ (d-l) 2 a 2 {3 2 _ {l-a-(3 + da(3)(l-a-(3-(d-2)af3) 



1 



and hence 
as calculated in Theorem l3.31 □ 



(l-a) 2 (l-/3) 2 (l-a) 2 (l-/?) 2 
a 2 f3 2 (1-a- (3)(l-a- p + 2a(3) 

(l-a) 2 (l-/3) 2 ~ (l-a) 2 (l-/?) 2 ' 



Proof of Theorem I7.2t Part (i) of the Theorem holds by Lemma [73l Part (ii) holds by Lemma [731 and parts 
(iii) and (iv) of Theorem 17.11 hold by Lemma 17.61 So a.a.s. a random graph has independent sets S and T 
counted by Y. 

□ 
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